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Abstract. On etudie la loi des plus grandes valeurs propres de matrices aleatoires symetriques 
reelles et de covariance empirique quand les coefficients des matrices sont a queue lourde. On 
etend le resultat obtenu par A. Soshnikov dans |18| et on montre que le comportement asympto- 
tique des plus grandes valeurs propres est determine par les plus grandes entrees de la matrice. 

Abstract. We study the statistics of the largest eigenvalues of real symmetric and sample 
covariance matrices when the entries are heavy tailed. Extending the result obtained by A. 
Soshnikov in [TH], we prove that, in the absence of the fourth moment, the asymptotic behavior 
of the top eigenvalues is determined by the behavior of the largest entries of the matrix. 

1. Introduction and Notation 

We study the statistics of the largest eigenvalues of symmetric and sample covariance matrices 
when the entries are heavy tailed. Extending the result obtained by Soshnikov in [18J, we prove 
that in the absence of a finite fourth moment, the asymptotic behavior of the top eigenvalues 
is determined by the behavior of the largest entries of the matrix, i.e that the point process of 
the largest eigenvalues (properly normalized) converges to a Poisson Point Process, as in the 
usual extreme value theory for i.i.d. random variables. This result was predicted in the physics 
literature by Biroli, Bouchaud and Potters [BJ. 

We first consider the case of random real symmetric matrices with independent and heavy 
tailed entries. Let (cty), 1 < i < n, 1 < j < n be i.i.d random variables such that: 

1 _ F{x) = F(x) = Pfloy | >x) = L(x)x~ a , (1) 

where a > and L is a slowly varying function, i.e., for all t > 

.. L(tx) 
lim TFT = L 

x^oo L(X) 

Consider the n x n real symmetric random matrix A n whose entries above the diagonal are 
the (dij), 1 < i < j < n. Hypothesis ([T]) would be natural in the theory of extreme values for i.i.d 
random variables. It simply asserts that the distribution of the entries is in the max-domain of 
attraction of the Frechet distribution with exponent a (see |13| . page 54). Thus, for any a > 
the point process of extreme values of the entries of A n (properly normalized) is asymptotically 
Poissonian. More precisely, let 



then the Point Process 



b n = mf{x : 1 - F(x) < 2 }, (2) 

n(n + 1) 

Vn= Yl 

l<i<j'<n 
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converges to a Poisson Point Process with intensity 



a 



It is also classical that there exists another slowly varying function L Q such that 

b n ~L (n)nv. (3) 

When L = 1, then b n = (Hi^2)V<\ 

We denote by Ai > . . . > X n the n (real) eigenvalues of A n and we consider the point process 
on (0,oo) of (normalized) positive eigenvalues of A n : 

Theorem 1. We assume (CP with < a < 4. For 2 < a < 4 we also assume that the entries are 
centered, i.e E(ajj) = 0. The random point process V n converges in distribution to the Poisson 
Point Process V defined on (0, 00) with intensity p{x) = pr^- 

This result thus shows that the largest eigenvalues of A n behave as the largest entries of the 
matrix A n when < a < 4. It was proved in the range < a < 2 by Soshnikov [18] . It implies 
for instance that the maximum eigenvalue has a Frechet limit distribution: 

Corollary 1. 

lim P(t^Ai <x) = exp(-x~ a ). (4) 

One word of comment is in order here. When the entries have light tails, it is well-known 
that the random field of largest eigenvalues is not Poissonian but determinantal, and that the 
fluctuations of the top eigenvalue are asymptotically distributed as in the GOE, i.e have a Tracy- 
Widom distribution [16] . We actually believe that the universal Tracy- Widom picture holds as 
soon as a > 4, see [6] for a discussion and simulation. Some steps in this direction have been 
achieved by A. Ruzmaikina [2], who proves that the Tracy- Widom limit holds for a large enough. 
(She claimed that a > 18 is enough, we believe that the arguments of [2] only work if a > 36, 
see Remark [3]) 

Let us first consider the case a > 2. It is well known that, if 



fJ>n 

denotes the spectral measure of -7=, fj, n converges weakly almost surely to a non-random limit, 
the semi-circle law, 

, , f \\\f4o- 2 - x 2 , if \x\ <2a 

V (X) = { ,11- ^ 

I 0, otherwise. 

which depends only on the variance a 2 of the entries. In the case where a > 4, Bai and Yin pQ 
have proved that the top eigenvalue sticks to the bulk, i.e that for all e > 0, 

P(|-=Ai-2o-|<e)-»l , n^oo. 

Jn 



This shows that for a > 4, 6 n 1 Ai — > 00 so that our result in Corollary [T] ceases to be true. Our 
result shows that ^Ai is, roughly speaking, of order n 2 /" -1 / 2 and thus diverges. This is in 
agreement with and sharpens Bai and Yin's result, who have shown that a finite fourth moment 
is necessary to have the convergence to the edge of the bulk. 

The case a = 4 with infinite fourth moment seems very interesting and still open. It might 
exhibit an interesting transition between the Tracy- Widom regime and the Poissonian one. 



Coming back now to the case a < 2, the situation is different. The bulk itself is not a semi- 
circle. It was recently proved by Ben Arous-Guionnet [3] that the spectral measure 

/i n = re" 1 y^Sxj, 

where 

c n = infix : 1 - F(x) < -} 

n 

converges to a limiting distribution a a . This limit probability distribution \i a is not compactly 
supported and has a polynomial tail of type i°„ dx for some constant C a . In this case (a < 2), 
this is perfectly compatible with the present result: the extreme values of iid random variables 
with that distribution [i a would have exactly the behavior we have given. 

We also study in this paper the behavior of the top of the spectrum for another very important 
family of random matrices, i.e the ensemble of large random sample covariance matrices. 

In this setting, we consider A n a re x p random matrix with i.i.d centered entries (ay), 1 < i < 
n, 1 < j < p and we define as usual the sample covariance matrix X n = -A n A l n . The asymptotic 
behavior of the bulk of X n is also well-known by the classical result of Marchenko-Pastur [11], in 
the case where we assume a finite second moment. The case where < a < 2 is treated in [7j. 

Similarly to (j2J), let 

b np = infix : 1 — Fix) < — }. 

np 

If Ai > . . . > A n are the ordered eigenvalues of A n A^ n and lim n ^ 00 ^ = 7 for some positive 
constant 7 > 1 defining 

V n = <5 6 -2 A . 

i 

we have the following: 

Theorem 2. We assume |2]j with < a < 4. For 2 < a < 4, we also assume that the entries 
are centered, i.e, E(ajj) = 0. The random point process V n converges in distribution, as p goes 
to infinity, to the Poisson Point Process V defined on (0, 00) with intensity p(x) = 2xl + a / 2 ■ 

Again, as a simple corollary, we obtain the behavior of the maximal eigenvalue: 

Corollary 2. 

1 

lim P(-pj-Ai < x) = exp(-x a). (7) 

The rest of the paper is organized as follows. First, in section [5] we recall briefly the main 
results contained in [18], i.e, the proof of Theorem [1] in the case < a < 2. We then prove 
Theorem [2] in the case < a < 2 in Section [33 We then study in section [J] and [5] the case where 
2 < a < 4 for the Wigner and the sample covariance matrix cases respectively. This is a bit 
different in nature since we now have to perform a more subtle separation of scales. This is done 
through an estimate of traces of high powers of our random matrices properly truncated. This 
combinatorial part of the proof draws on the work of Soshnikov [16] and on the recent work by 
Peche-Soshnikov [12] . 

2. WlGNER MATRICES WHEN < a < 2. 

In this section, we will recall the results in the paper of Soshnikov [p5] (see also |17j for precise 
statement of Lemma [3J . 

Let A n be re x re random (real) symmetric matrix with iid entries satisfying ([T]) with < a < 2. 
Also let Ai > A2 > A3 > . . . > A n be its eigenvalues and a-i l j l its Z-th largest entry in absolute 
value. 



In order to prove Theorem Q] in this case, Soshnikov [18] proceeds as follows. The basic idea is 
to show that for each finite k, and for each given e > 0, 

P(| — ^- - 1| > e) -> as n^oo. (8) 

a ikjk 

We first consider the case where k = 1, which implies corollary [TJ The following crucial lemma 
is purely probabilistic. It describes how the largest entries are placed in the matrix. This lemma 
will be adapted in all other sections. 

Lemma 3. (a) With probability going to one, there are no diagonal entries greater in absolute 
value than bn 1 ^ 20 . 

(b) With probability going to one, there is no pair (i,j) such that |ay| > b n 9 ^ 100 and \au\ + 
I I ^ h 1 / 10 

| CLjj | S> On 

(c) For any positive 5 > with probability going to one there is no row that has at least two 

entries greater in absolute value than bf/ 4 + 5. 

(d) With probability going to one, there is no row such that its maximum and the sum of the 
absolute value of the remaining elements in the row are both greater than b n ^ +a ^ 8 . 

Once one has proved the previous lemma, the next step is to relate the entries of the matrix 
with its maximum eigenvalue Ai. This can be done in two steps. 

First, one can bound from below the top eigenvalue using the Rayleigh-Ritz representation of 
Ai: 

Ai = sup (A n v,v) . (9) 

v:\v\=l 

Considering a well-chosen vector v in terms of the position of the largest entry of A n , will 
provide the inequality 

Ai > 0^(1 + 0(1)). 

Secondly, one studies the following norm of the matrix A n , that is the norm of A n as a linear 
operator from l^ to l^: 

n 

||A n ||oo = m&x) \aij\. (10) 

i ' — * 

i=i 

Since A n is symmetric, we can show that H^Hoo is an upper bound for Ax- Lemma [3] then 
relates ||-^4.n||oo 

to the maximum entry of the matrix A n . In fact, Soshnikov showed that, given 
e > 0, there exists 9 > such that for n sufficiently large, one has: 



I A I 

I - t1L n 1 1 co , 



> e ) < nexp(— n e ). 



mflrX^ j I (x% j I 
This proves ([8]) when k = 1. 

Now, for any finite k, using Lemma El it is possible to find some well-chosen unit vectors Vk, 
such that, 

A n v k = a ikjk v k +r k , and ||r fc || = o(l). 

This fact together with a standard result in pertubation theory of symmetric matrices (see for 
instance [1] page 77) imply that A n has eigenvalues 0,^(1 + o(l)), 1 < I < k, for any finite k. 

To finally get Theorem [T] we use induction on k in ([S]), supported by Corollary [T] and the 
following very classical result about symmetric matrices, that can be found for instance in [3], 
page 59. 



Proposition 4 (Cauchy Interlacing Theorem). Let A n be an n x n Symmetric matrix and X\ > 
A2 > • • • > A n its ordered (real) eigenvalues. If one considers the restriction B of A n to any 
subspace of co-dimension 1 and denotes by fi± > fi2 > ■ ■ ■ > fJ> n -i the eigenvalues of B then 

Ai > Mi > A 2 > a*2 > • • • > (J-n-i > A n . 

We briefly explain how to use the last statement in this setting. For simplicity, instead of 
describing the general step of the induction, we describe only the step k = 2. First, one considers 
the submatrix obtained by removing the i%— th row and the j±— th column from A n . Clearly, this 
submatrix has Oj 2 j 2 as largest entry in absolute value, and by the interlacing property its largest 
eigenvalue will be greater than A 2 - Thus, one can apply Corollary [T] to finally get (|8|) for k = 2. 

To get Theorem Q] as stated, i.e, to prove tightness and the convergence of V n to V, it suffices 
to prove, see |10j Theorem 16.16, that for all intervals (a, b), where < a < b one has that the 
random variable V n (a,b) converges in distribution to V(a,b). This can be verified as follows. 

Since 

F(V Tl (a,oo) > fc)=P(^±i >a ), 

it is easy to see that proves the convergence in distribution of V n when restricted to an interval 
(a, 00), a > 0, i.e the random variable V n (a, 00) converges in distribution to V(a, 00) which is a 
Poisson Process with parameter a~ a . 

To derive the result for a general interval, we first note that since 

nn{b}) > 0) = 0, 

we can consider intervals (a, b] which can be written as the difference of I a = (a, 00) and 1^ = 
(6, 00). Now, 

00 

F(P n (a,b] =1) = ^>CP n (a,oo) = k,V n (b,oo) = k-l). (11) 
k=i 

Each term inside the sum converges to P("P(a, 00) = k, V(b, 00) = k — I) and is also bounded 
by F(V n (a, 00) = k). Since the sum 

00 

a, 00) = k) 

k=i 

is finite, an application of Fatou's Lemma shows that (jlip converges to 

00 

F(V(a,b] =1) =^P(P(a,oo) = k,V(b,oo) = k-l). 

k=l 

The details omitted here will be considered in the next sections, specially the next one, where 
we follow the same strategy in the case of Sample Covariance matrices with < a < 2. 

3. Sample Covariance matrices when < a < 2. 

The proof of Theorem [2] in this case is based on the following lemma, which is almost identical 
to Lemma [3] given in last section. 

Lemma 5. Let A n be a n x p random matrix with i.i.d. entries a^j, l<i<n, l<j<p 
satisfying {!]). Also assume that linin^oo ^ = 7 for some constant 7 > 1. Then: 

(a) If B^ is the event 'There is a row with 2 entries greater than b & np in absolute value' then 

V<5 > 3/4, lim P(B* ) = 0. 



(b) Also, 

v 

lira P(3i, 1 < i < n, max loj,- > 6n» 8 and > I a» I — max la^ l > bnv 8 ) = 0. 

n^oo l<j<p 1<J<P 

(c) Similarly, 



3 i as 



lim P(3j, 1 < 7 < p, max lawl > bn V 8 and |aj 9 -| — max |oj,-| > 64, 8 ) = 0. 

ra— »oc l<j<n J z — ' J l<j<n 

3.1. Proof of Lemma [5l 

Proof of (a). First, a basic fact from slowly varying functions that will be used repeatedly in this 
paper (see for instance [5], chapter I). Given any S > 0, one has that 

aT 5 < L(x) <^x s as x -> do. (12) 

We recall that /(x) <C g(x) means that the ratio tends to as x tends to infinity. Hence, 
using (fl2|) and choosing 5 = | + e, 

P(5* p ) = P(3i < n, 3j, fc, j ^ fc, S .t.| % - 1 > ^and^l > b% 4+e ) < p 2 n(l - F(b%^)f 



< p 2 nL(b%^' 



n P ' ,3«/2+2e« 
Onp 

= o(n-^ e ) 

for a small enough 9, since for any e > 0, b^ 2+2ea 3> n 3 by the definition of b np . 

Proof of (b). We split the proof in two cases. The idea in both cases is the same and the 
computation almost identical. 

We start by assuming that 1 < a < 2. 

Let T g N be such that 1/(2T + 1) < 1/4 - a/8. 

Proposition 6. Assume that 1 < a < 2. There exists > such that, for n sufficiently large, 

i \ 



1 3 



T+l ~ 



> 1 — nexp(— n e ). 



Proof. In order to prove the last proposition we introduce K° = p, and for k > 1: 



if = #(1 <j <p: |ay| >b 

so that 



2T+1 \ 



E 1 ^Kii<E^ +1 - as) 

Lemma 7. Letk<T. There exists 9>0 such that F(Yf > 2EYf) < exp(-/). 



Proof of Lemma 7. By definition of K , and setting by convention b%p +1 = 0, for all 1 < i < n 
and 0<fc<T, fcGNwe have 

WY*=pF{b^). 
Also, using Chernoff 's inequality, we have that 

p(y/ > vmy?) < ex P (--Ei; fe ). (14) 



Thus, we can argue as follows: 

First, replace the value KY^ in ()14j) and use the expression for F(x) given in JT]). Hence, by 
([12]) there exists e sufficiently small such that 



P(5? > 2EY?) < exp(--pF(b^ +l )) < exp(- -pF '{b^ )) 



T To 



< exp(--pL(&^ +1 ) < exp(-p/46, 



2T+1 t 2T+1 ' 
np "rip 



T (a + e) fl 

< exp(-p/4(np) 2T + 1 « ) < exp(-jr), 



where the last inequality is justified by the hypothesis lim^^oo ^ = 7. 
Lemma 8. Assume that 1 < a < 2. Then, 

T 

4 



, S+l 1 3 , a 

E»^ +1 < I M P +8 

fe=0 



Proof of Lemma 8. One has that 
/ \ 



E 



v':Kj|<fe 



T+l 



fc+1 



J 



k=0 



k=0 



fc+1 -fcq 



fc(l + <S-q) 
2T+1 

'np 



fc=0 fc=0 

where 5 > can be chosen such that 1 + 5 — a < 0. Therefore, 



/ 



E 



\ 



E 



T+l 

\j':|aij|<bnp +1 / 



1 00 fc(l + ,5-q) 

k=0 



1_ 1 .3 _|_ Q. 

i "np _i ^ u np 



□ 



(15) 



(16) 
□ 



Combining the last two lemmas with (|13p . we get proposition 6. 



□ 



T+l 

Now we turn back to the sum of all terms which have absolute value between bnp +1 and 
frnp 4+Q//16 - This sum is easier to handle since we have fewer entries. To simplify a little bit the 



notation below, put fi = 2T+1 • 

Proposition 9. There exists k > such that 



\ — ^ 1 — +— ^ 

3i 'EV^ ^ i^i+W^' 1 - 2 6 ™ p 8 ' ^ ex p(- nK )- 

V-Kp<\aij\<b£ p "'} A 



Proof. 



V {rKp<\aij\<bnp } 1 J 1 



< n 



J np) 

at 



u np 



< exp(-n K ), (17) 
for some sufficiently small k > since ^ > 1/2. □ 

Let us finish the proof of statement (b) when 1 < a < 2. By part (a) we know that, with 
probability going to 1 as p goes to infinity, there is at most one term in each line that exceeds 

3 _i_ a 

bnp 16 in absolute value. So it is enough to consider the sum of all entries less than or equal to 

bnp 16 and prove that in fact the probability that this sum is less than b np 8 goes to one as p tends 
to infinity. We proved this statement in two parts, analyzed in proposition [6] and proposition [9] 
respectively. 

Case < a < I: 

We repeat the same argument and computation used in the other case. We begin by proving 
the counterpart of Proposition [6] in this case. 

Proposition 10. There exists > such that for n large enough 

1 3_j_o: \ 

/I T+\ \a>ij\ < -bnp 8 , > 1 - nexp(-n e ). 

T 0':KI<6S qT } 2 J 

Proof. Lemma [7] is valid when < a < 2 so that it is enough to prove 
Lemma 11. Let < a < 1 and T such that 2 t+i ^ § • ^ en > 

f>^|^<±4 +f - (18) 



k=0 



Proof. One has that 



fc+l - 1 fc k+l -kg 



^Y?bn% +1 < £ 2pF{bn% +1 )bn% +1 < 2 P ]T )blV' b% +1 
k=0 k=0 k=0 

1 k(l+S-g) i a I 3 a 

< 2nb 2T+1 S^h 2T+1 < b 2 T < -b 1 J 



fc=0 

□ 

Using Proposition [10] and Proposition [9] as before it is easy to prove statement (b) of Lemma 

El □ 

Proof of (c): As one can easily see, the proof of item (c) of the lemma is identical to the proof 
of part (b) up to a permutation of p's and n's. □ 



Remark 1. Recalling definition (jlOp . statement (b) of Lemma Proposition [6] and Proposition 
[10] show that for every e > there exists n Q (e) and 9 > such that for all n > n Q one has 

I ^4r>. 1 1 no -.I \- / fl\ 

- 1 1 > e I < nexp(— n J. 



max,-, a 



By part (c) of LemmaO the same is valid if one replaces ||^4 n ||oo above by ||^4n||i = sup.,- Ya=i \ a ij\- 

Remark 2. From now on, if X n and Y n are two sequences of random variables defined on the 
same probability space, we will use the notation X n = Y n {\ + o(l)) to indicate that for all e > 0, 
the probability Pdy 11 — 1| > e) goes to as n goes to infinity, i.e the ratio & converges in 
probability to 1. 

3.2. Proof of Theorem El 

Proof of Corollary PJ1 We begin by the proof of Corollary [2j The main thing to show is that 

A - — 1, (19) 



in probability as n tends to infinity (we recall that |). In fact, if we assume (fT9|h 
extreme value theory for iid random variables tell us that 

a 2 ■ 

lim P(-^- < x) = exp(-x"t), (20) 

so (fT9]l and (f20l) will imply Corollary [2j Thus, our task is to prove (fT9|) and the idea is as follows: 
Given e > 0, we want to show that for n sufficiently large we have 

Ai>af m (l + o(l)), (21) 



Ai<af m (l + o(l)), (22) 

with probability greater than 1 — e. The main tool used to prove both equations will be Lemma 
El and we will start with the easiest inequality, (f2"Tl) . 

Since for all unit vectors v we have the bound (X n v,v) < Ai, our task is the following: we 
must find a suitable vector that gives us OTl) . Therefore, let be the position of ai 1 j 1 in A n 

as the notation suggests. If one takes v = (0, . . . , 0, 1, 0, . . . , 0) where the sole non-zero entry of 
v is in the position ii, the vector v will do the job. In fact, 

v p 

{X n v,v) = ^af u - = <4ii+ a2 iii = a iiji^ + °( 1 ))' 

by part (b) of Lemma [5] and the fact that aj Ul is the maximum of np iid random variables. This 
proves ([2T]) . 

To obtain ()22[) we first recall the definition of 

n 

\\X n \\oo = SUp^ \Xjj\. 
i . , 



The eigenvector equation for Ai 



implies that 



Yl Xi i v i = A| '' ; 

3=1 



so, 



AiKi < yji-XjjiKi < sup 1^1 y] \Xjj\ 



Ai sup \v,i\ < I sup \vi\ ] sup^^ \X. 

i \ I J i . 1 



V I 

Therefore, H^Hoo is an upper bound for Ai. 
Hence, with probability going to one, 

{n p | | p n 

KfciKfci > < sup < ^2 \ a ik\ ^2 
j=l k=l ) i [k=l j=l 

< sup < > j \a ik \ } sup < > j \aji\ > < || 

* \tl J ' \fe J 

< af m (l + o(l)), 

where the last inequality comes from Remark [TJ □ 



Proof of Theorem It is enough to show that for any finite k we have for all 1 < I < k 

lim P(tJ-A, < x) = lim P(t2-oL. < x), (23) 

where a^^ is the I— th term of the sequence |<Zy| in the decreasing order. 

Let e\, . . . , e p be the standard orthonormal basis of W. If we compute X n .ei t we get: 

n 

x n£ii = y2 Xii i ei = Xi in ei i + r; ' ( 24 ) 
1=1 

for some vector r; in R n . 

Also, since AT n is symmetric, one can find a orthogonal matrix £/ and a diagonal matrix 
such that X n = UDU" 1 . Now, suppose that Xi x i x is not an eigenvalue of D. Then D — X^I is 
invertible and one can use equation (12*1]) to get: 

1 = \\ eil \\ = \\U(D - Xin^U-Wl < \\U\\ WU' 1 || || (D - X ilH I)- l \\\\nl (25) 

which implies 

min|Aj - X kk \ < \\ri\\, (26) 

i 

so there exists an eigenvalue A of X n such that 

|A-X Mt | < Ihll- (27) 

If is an eigenvalue of D — X^I, ([27j) is clearly satisfied. 

We now know, by Lemma [5j that X^ = af • (1 + o(l)). Therefore, if we manage to prove 
that 77 has a norm that is negligible with respect to of^, we will be able to say that Af n has 
eigenvalues ^^(1 + o(l)), 1 < I < k for any finite k. 

Bounding the norm of r% , one gets 




k=l 

We cannot estimate Si directly as we did in part (a) but if we define 



$2 — X \ a nk\ X \ a ik\i (28) 
k=l,k^ji i=l,i^i, 



then 



5*2 < sup ^2 \a ik \ X \a kk \, (29) 

\ k i=l,i^ii J k=l,k^=ji 

which tells us that S% is negligible with respect to ct? - , again by Lemma Now, 

n 

Si — S2 = X \ a iik\- (30) 

k=l,k^ii 

which is also negligible with respect to af^ since X)fc=i k^j t \ a kji \ 1S negligible by a direct appli- 
cation of part (b) of Lemma EJ Hence, Si is also negligible with respect to «f i;?1 - 

We now know that af U; (l+o(l)), 1 < I < k are eigenvalues of X n . However, this does not imply 
that they are exactly the k top eigenvalues. At the moment this is true only for the maximum, 
by Corollary ([2j), and we need to check it for 1 < I < k. In other words, what we get from the 
last statement is that for all 1 < I < k, for p large enough: 

A,>o? M .(l + o(l)), (31) 

and we need to prove the reverse inequality. To achieve our goal, we consider the compression of 
the matrix X n step by step, i.e., we cut from A n the row ii and from the column %i and then 

we compute their product. By part (a) of Lemma the entry aj 2j2 is still in the matrix and the 

(2) 

product Xn is just the matrix X n without the row and column ii. Now we know by the Cauchy 
Interlacing Theorem, see [1], Corollary 3.1.5, that if ki > . . . > the eigenvalues of X„ , 

we have: 

-^l > K i > A2 > ^2 > . . . > Kp-i > A p . (32) 

Combining (|32|) with Corollary [2] applied for the matrix Xn , we get the desired inequality for 
A2. Repeating the same argument for all 1 < I < k, the proof is complete. 

□ 

4. WlGNER MATRICES WHEN 2 < a < 4 

Now we consider the symmetric random matrix (£%)™., = i where the entries of A n are centered 
i.i.d. and satisfy ([T]) with 2 < a < 4. We treat separately the case where a = 2. 



4.1. Truncation. The main difference between the proof of this section to the previous one is 
that here we should care about the contribution given by the bulk of the spectra, that is we 
should control in some way the smaller entries of the matrix A n and then proceed as before. 
Thus, to investigate the behavior of the largest eigenvalue Ai, we split the above random matrix 
as follows. Let (5 be such that 

1 „ 2(8 -a) 
a a(10 — a) 

and we define 

A l = {Ai3^\an\<vfi)i^=l, M = A n - A x . (34) 

Since (3 < — , it is also clear that with probability going to 1, the largest entry of A n is the 
largest entry of A2. The condition that j3 > 1/a is assumed to guarantee that we can study the 
asymptotic behaviour of the eigenvalues of A<i in a similar way of the previous section. On the 
other hand, the condition that (3 < is assumed to guarantee that the spectrum of Ai, 

properly normalized, remains bounded. 

4.2. Bounding the spectrum of A\, We first investigate the behavior of the largest eigenvalue 
of A\ referring the reader to the results of [2] and pQ . These papers deal with the case of random 
Wigner matrices with the presence of a finite fourth moment and they prove boundedness of the 
spectra. Fix some e > such that 

Here, we will prove that the largest eigenvalue and the smallest eigenvalue of 2 /i-e -^i are 
bounded on a set of probability arbitrarily close to 1. In this direction, our main result in 
this subsection will be: 

Proposition 12. Let s n be some sequence going to infinity in such a way that logn << s n « n 1 
where < 7 < min{|(^ — 1 — /3(5 — ^)), — j}. Then there exists a constant C > such that 

Before giving the proof, we indicate how to use Proposition [12] to deduce the desired result, 
that is the boundedness of the largest eigenvalue of 2/ >* _ e At- We have 



M-^-7^i)>4a < 



£(Ai(-37^i) 2s " 



~ n 2/a-e " ~ J ~ (4fj) 2s " 

A 

< 



(4cr) 2s " 
< exp(-r?s n ) 

for some constant r\ > 0, proving that Ai( ^ 2/ 4_ e A\) is bounded in probability. By symmetry, one 
gets the same result for the smallest eigenvalue. 

Proof of Proposition To estimate E ^Tr( ^ 2 ^_ e ) 2Sn ^j, we use the moment method. Developing 
the expectation, we have that 

E (Tt(j4i) n ) = Eajojj fij^ 6^213 aj 3 j 4 aj 4 j 5 . . . Q>i 2gn _ 2 i 2sn _ 1 ai 2sn _ 1 i , (35) 

where = o-ij^-{\ ai \<n^} an d denotes the set of all closed paths P = {io,«i, • • • ,^2s„-l)*o} 
with a distinguished origin, in the set {1,2,..., n}. 

The following two lemmas are a direct consequence of [8j, chapter VIIL9, Theorem 2.23. 



Lemma 13. Let C n = E(tiy-) = M(aijl\ aij \ <n p). Then for n large enough, \C n \ < L{nP)n^ x Q ) 
where L is defined in (Op. 

Lemma 14. For any k>2, let Df = E(ag") = E( afj 5 -^| 0i .|< n ^)- Then for n large enough, there 
exists a slowly varying function Iq such that D 2k < lo(n^)n^^ 2k ~ a \ 

Thus it follows that 

|Ai(Ai - EAi) - Ai(Ai)| < \C n \n < L(n /J )n /3(1 - af)+1 , (36) 

so one can write 

Mr A n) ^ A i(^(^i - + Mit(a 2 )) + -^IAi^ - eao - a^oi, (37) 

where the last term tends to zero as n tends to infinity since 0(1 - a) + 1 - I < 0. Thus, wc 
may assume that A\ is also centered, i.e, the truncated variables centered. 

Now we move back to equation (|35|) . to compute the expected value of the trace of n ( 4 / c , 1 _ 2e ) a „ • 
The first step after the centering is to consider the contribution of even paths, i.e. paths such 
that each edge occurs an even number of times. 

We refer to the paper of A. Soshnikov [TU] for most of the details and further notation. To 
each path P = i Q — > i\ —* i<i . . . —* i2s n ^ 1 ^cb we nrs t associate a set of s n "marked instants" 
as follows. We read the edges of P successively. The instant at which an edge i — ► j is read is 
then said to be marked if up to that moment (inclusive) the edge was read an odd number 
of times. Other instants are said to be unmarked. Now, the number of possible arrangements 
of marked/unmarked instants in a path of length 2s n is equal to the number of Dick paths, i.e., 
the number of simple random walks of length 2s n , starting and ending at 0, and conditioned to 

remain in the positive quadrant. The number of Dyck paths is known to be the Catalan number: 

(2s n )\ 
s„!(s„+l)!' 

We say that a vertex is marked if it occurs at a marked instant. For any < k < s n , we then 
define 2V& to be the subset of vertices in {1, ... ,71} occurring k times as a marked vertex. Any 
vertex belonging to JVjfc is said to be a vertex of self-intersection of type k. For any < k < s n , 
we denote by the cardinality of Nk and call (n Q , n\, . . . , n Sn ) the type of P. Note that all the 
vertices that appear in P belong to Nk,k > 1 except possibly the origin i Q . 

Then, it is easy to see that {1, ... ,71} splits as the disjoint union of the sets N Q , N±, . . . , N Sn . 
From these definitions, one can see that 

nfc = n, and knk = s n . (38) 
fe=o fe=o 

To estimate the number of possible paths and their contribution to the expectation, we proceed 
as follows. We first determine the set of marked instants and the type of the path. Then, we 
assign labels chosen in {1, . . . , n} to each marked instant and to the origin of the path. Finally, 
we assign labels to each unmarked instant and consider the expectation of the corresponding 
path. 

Given the set of marked instants and the type of the path (n Q , m, ■ ■ ■ , n Sn ), one has exactly 
ris„ Sn /] rytn: ways to distribute the marked instants into the possible classes of self-intersection. 

The number of ways to distribute the vertices of {l,...,n} into the set of possible classes 
JV , N±, . . . , N s and determine the origin of the path is at most , „ — \ n - This is because 
the origin is in general a non-marked vertex. There now remains to give an upper bound on the 
number of ways to determine vertices at unmarked instants, that is fill in the blanks of the path. 
It was proved in [19] that the number of ways to assign labels at unmarked instants is not greater 
than rifc=2(^) fcnfc - Indeed, the number of possible ways to determine the right endpoint of an 
edge starting from a vertex of type k at an unmarked instant is at most 2k. 



To consider the expectation of a path P of type (n ,m, . . . ,n Sn ), we will need the following 
Lemma. 

Lemma 15. Consider an even path of type (no,ni, . . . ,n Sn ). One has 

v {„. a a ,, a a- a- ■) < a 2Sn TT (l f^y^M^-l))^™ 1 ( o q) 

V «0«l"'«l*2 u '«2«3 U '«3*4 tt «4«5 • • • U *2s n -2«2s„-l u '«2s„-l«0y — U L X V J " J V 0t V 

Proof. Assume that a non-oriented edge (ij) is seen 2l{ij) times. We denote by (resp. 

the number of times i (resp. j) is a marked vertex in (ij). We also set L(ij) = 
xaax{l(i;ij),l(j;ij)} and L'(ij) = mm{l(i;ij),l(j;ij)}. 
First, using Lemma HU we deduce that 

U mm >^ J) < II Un^m-o) 

(ij):l(ij)>l 

< 11 / (^)n^ 2L fe') +2 ^+ 2L 'fe')- 2+2 ^) (40) 

(ij):l(ij)>l 

Second, we change the product in (f4T)J) over all edges to a product over all vertices. In fact, 
one can associate to each edge occurring 4 times at least one marked occurrence of a vertex of 
self-intersection. To deal with vertices where L'(ij) = L(ij) = 1, we say that L'(ij) is associated 
to the vertex which has the smallest multiplicity in the path. Using the fact that the number of 
marked occurrences of any vertex in edges seen at least 4 times cannot exceed the type of the 
vertex and ([38]) . we get: 

u mm>1 ^ 3) < n ua^) 

(ij):l(ij)>l 

< n /3E W ) ;iW )> 1 (2Lfo-)+^)+/3E( lj);i ( lJ )>2(2L'fe')-2+^) "Q l o ^fn k 

k>2 

<n(^( n/3 ) in/3(2j " ( ^ 2)/2) ) ni - (4i) 

i>2 

□ 

Remark 3. Lemma [T5l plays the role of Formula 4.7 in [T3] to bound the contribution of a single 
path of type (no, n\,..., n Sn ). Here is where we cannot understand the arguments of [H]. Indeed, 
with the notation of [H], we believe that formula 4.7, 



V 

i 2s n — 1 \ .j % s n 

E II ^ + A^i-e n )W n <—i r ]J(2kC) k ^l[(2kAl)^ 

V u=0 / k=3 fc=| 



should be written as 

2s n -l 



u=0 / k=3 fc=| 



n/ ' 



since, as argued in Lemma \15\ an edge seen p times implies the occurrence of a marked vertex 
of type at least p/A but not necessarily of type p/2. Mutatis mutandis, the other arguments of 
Ruzmaikina carry out to show that the universal Tracy- Widom limit holds if a > 36. 

We now call Z e the contribution of all even paths of type (n ,n\, . . . ,n Sn ) to the expectation 

ETrf^^) 2 ^. Writing 

ETr I — rr^ — | = ETr ' 1 1 ' 



n 2/a-e I ' \Jn J \ n 2 / Q - 1 /2-e 



we deduce that Z e is bounded by: 



Z P < 



(2a n )! 



s I 
°n ■ 



(V«-2 6 ) no!n x ! . . . n s J s n l(s n + 1)! \\ s k n =2 {k\)^ 



< 



n ' 

xct 2s " j j s /;;/? 

fc=2 
1 



JJ(2fc) 



fc=2 



no 



kM2k-{a/2- 



ii k 



< 



n (4/a-l-2e)E l > 1 m ! 
S<rc 

X(7 2S « JJ ^J n /3(2fc-(a/2-l))j 

(2a n )! 



n...(n + l) (2s n )! 
n 



a s„-ni 



S n!(^ + l)!re=2«fc!re= 2 (^) ,_, 



JI(2fc) 



fc=2 



n (4/a-2e-l) E 4 >a *"i s n !(s n + 1)! [^=2 ™fc ! Ul=2( ke 



k=2 



k=2 



< 



(2s n ] 



2s, 



s n \{s n + l)\ 



no 



k=2 Uk - 



lk s k n P(2k-(a/2-l)) 



n 



Ak/a-2kt-l 



Uk 



1 



ni(4/a-2e-l) ' 



(42) 



First, we consider the contribution of simple paths, that is paths of type (n Q . m, 0, . . . , 0). We 
denote Z e s this contribution. Then 



Z e .s < 



1 



3n(4/a-l-2e) ' 



J? 



s n \{s n + l)\ 

This follows from the fact that in simple even paths, any edge is seen twice and the choice of the 
origin and marked vertices determines the path. 

We next turn to the contribution of paths with self-intersections, which we denote by Z e ^. Now, 
if we take the sum over all non- negative integers n^, «3 , • • ■ n Sn such that 



/^2 n k > 0, 



(43) 



k=2 



we have that: 



Z e _i < 



-no 



2-S j"; 



s n \{s n + l)\ 

(2«„)! 



e n 



i 

n fc ! 



7fc_fc„ y 9(2fc-(a/2-l)) 



n2,...,n Sn fc=2 

Now as e < 2/a — /? and s n n 2 ^ << n 4//a , we deduce that 



n 



Ak/a-2ke-l 



iik 



(44) 



< 



s n \{s n + l)\ 

<c (2s " )! 



no 



2*, 



^ /If! 



Ca 2 n /3(4-(a/2-l)) 



-ner 



M! n 

Af>0 



8/a-4e-l 



A/ 



s n !(s n + l)!"" n 8 /"" 4 - 1 
< c4^1 T7 n ff 2s "6?/( 5 - a / 2 ) +1 - 8 / a+4e 



s n !(s„ + l)! 

v J s n \( Sn + iy. 

In the last line, we have used the fact that s n « n 



(45) 



(1-1-/3(5- 1))/8 



. Now, we will see that this is also true for a path such that an edge occurs an odd number of 
times, proving proposition 1121 The necessary tools are the gluing and the insertion procedures 
developed in |12| . We refer to this article for details and notation. 

In [12] . one can prove that given a path P of length 2s n with 21 non-returned edges, it is 
possible to construct a sequence (Po, Pi, P2, ■ ■ ■ , Pj), 1 < J < 21, of subpaths of P such that 
the concatenated path W = \J i=Q Pi, i.e. the path defined as if we read Pi in order, has length 
2s n — 21 and belongs to one of the following classes: 

A W is a closed even path. 

B W is a sequence of I < 2 closed even paths where each origin is a marked vertex of P. 
C W is a sequence of / < 2 paths where each origin is a marked vertex of P and the union 
of these paths has only even edges. 

The surgery in P consists only to remove the last occurrence of odd edges and reorder the 
remaining subpaths with the possibility of choosing the direction in which each subpath is read. 
To estimate the contribution of odd paths, one can reverse the above procedure, defining an onto 
map from paths of classes A, B, C to the set of odd paths. This was done in [12] and we concisely 
describe the method. 

In case A, the simplest one, given a closed path W of length 2s n — 21 one needs only to choose 
J vertices to split W, choose the order and direction of each subpath and how to assign and 
insert the 21 unreturned edges. Also, since a\j are bounded by vP ', adding these 21 repetitions we 
multiply the contribution of the original path at most by n 2l @. Briefly, the contribution of paths 
with 21 odd edges such that W(P) £ A can be bounded by: 



Cl 2{Sn „ l)T (2s n -2l\ j(2l\ (2s n -2l)l (^_ } 

Cl ( Sn -i) Ksn -i + iy na j y- 1 {j) ( 2s n -Ai + j)\{ n y^l ' (4bj 



which can be bounded by: 



2/ 



This is less than or equal to: 



(2»„ - 21)! 

%„- i)!0 ,„- i+1) r ■ <48) 

The summation over I of the above gives a contribution which is negligible with respect to 
(2a) 2Sn nsn 3/2 since s 2 n < ni~ e_/3 . 

Also, [12] gives us the following estimate for the contribution of odd paths coming from the 
class B: 



Again, proceeding as before there is a constant K > such that (I49|) divided by the contribution 
of even paths can be bounded by 

s w — 1 

^ (Ks n nP-^f, (50) 
l=i 

which tends to as n goes to infinity. As shown in [p2], with a little bit of effort, the counting 
in case C can be reduced to the one in case A or B, which finishes the proof of the proposition. 



Remark 4 (Case a = 2). When a = 2, we do not use Proposition [12] as it is written to bound 
the largest eigenvalue of the truncated matrix. In fact, since one knows that, see [8], 

f(x) = E^-lflayKa.}) 

is a slowly varying function, we can show that for any < 5 < e 



^Ai(^)>4)-0, („-oo). 



□ 



4.3. The largest eigenvalue of A 2 . In this subsection we will prove that the largest eigenvalue 
of A2 is actually asymptotically given by its largest entry in absolute value. For short, we denote 
by Aij, i,j = 1, . . . , n the entries of the matrix A2 and by A{ 1 j 1 the largest one in absolute value. 
The aim of this subsection is to prove: 

Proposition 16. One has that for any e > 

P(|^^)-l|>e) — 0. as n^oo. 

Proposition [TC] will be enough to end the proof of the Corollary [JJ as we will explain now. First, 
we point that since the matrices that we are dealing are symmetric, \\A\\ = max{|Ai(-A)|, |A n (-<4)|} 
is exactly the operator norm of the matrix. Then triangular inequality implies: 

^maxij-^) + A m j n ( — A\) < \i( — A n ) < A max ( — j4i) + Xmaxij- ^2) ■ (51) 
On On n n n 

Now, since " 2 ^ a goes to as n goes to infinity, the boundedness in probability of the largest 
eigenvalue of 2 /i-e -^i as proved in the last subsection implies that A max (^-^4i) goes to in 
probability. Then, by proposition [TBI we have that the largest eigenvalue of A n behave just as 
the largest eigenvalue of A2, i.e, 

lim F(X max (—A n ) <x)= lim F(X max (—A 2 ) < x) = lim P(^i < x ) = eX p(-x- Q ). (52) 

Proof of Proposition 1 1 61 The proof of Proposition [16] relics on the two following lemmas, repeat- 
ing the arguments of Lemma EJ 

Lemma 17. Let e > be fixed. With probability going to one, one has that 

(a) There are no diagonal entry An greater in absolute value than bl/ 2+t . 

(b) There is no pair (i,j) such that \Aij\ > & 99//10 ° and \An\ + \Ajj\ > bl/ 10 . 

(c) For any 5 > 0, there is no row that has two entries greater than bn^ +S ■ 

Proof, (a) follows from basics results about extremes of n independent variables. For (b), one 
recalls ([T2]) to compute 

P(3(U)||4-| > 6r i0 °and \A U \ + > < 2^F{b^)F{bT) 

99 1 -109a 

2 i 



< n 2 L{b^)L{bh°){b n 

18 , a 

= o(n 100 " l " 17 ) 



for some 9 > small enough. 



Similarly, another application of (|12j) yields 

P(3i < n, 3j, k<n,j ^ k, 8.t.\Aij\ > b 3 n /4+5 and \ A ik \ > bf/ 4+s ) < n 3 F{b 3 J A+s )F{b 3 J i+s ) 

< n 3 L(b 3 n / A+5 " 2 ' 

= o(n- 45+d ) 



y3a/2+2Sa 



for some 8 > small enough. In the last line we have used that for any 5 > 0, ^ x / 2+2&a ^> n 3_ [— | 

We now show that the largest entry Ai 1 j 1 determines the largest eigenvalue of A 2 . The idea is 
similar to the first part with minor changes. Introduce the vector 

vi = ^|( e ii ± e ii)> 

where the sign ± is determined by the following rule: ie^ = +ej 1 if Ai 1 j 1 > and otherwise. 
Then, with probability going to one, 

(A 2 v 1 ,vi) = -A ilix + -A jlh + \A ilh \ = 11^1(1 + 0(1)), 

in view of item (b) of the preceding lemma and the fact that Aj 1 j 1 is the largest entry of the 
matrix A 2 . Thus, again with probability going to one, 

\max{A 2 ) > 11^1(1 + 0(1)). 

We now turn to the upper bound which follows from the following Lemma. 
Lemma 18. One has that 

n „ 
- + — * ^ ~ „ 3_|__a. 

Jim ¥(3i, 1 < i < n, max \Aij\ > bn 16 and > \Aij\ - max. \Aij\ > bi 16 ) = 0. 



i<i<". 



i<i<« 



Proof. Recall that by ([33]) we have aj3 > 1. 
Now, a trivial union bound gives us that 



3i, 



E 



\ j--nP<\A l3 \<bl + ™ 



, 1 | + 



< nP #{j : 7/ < I Ail < $ 32 } > 



/ 



< n 



< n 



l+bp(l-al3) 



< exp(-n K ) 



(53) 



for some sufficiently small k > 0. Thus, (|53p together with part (c) of Lemma [17] yelds Lemma 
[13 

□ 

Now, combining Lemma [18] with (c) of Lemma [T7] and repeating the same argument done in 
last section to prove part (b) in Lemma [5] Proposition [16] holds. □ 

Proof of TheoremUl when 2 < a < 4. By Proposition [T2| with probability going to 1, we know 

A% is bounded. Also, let Ai l j l be the I— th largest entry in absolute 



that the spectrum of , „. 
value of A 2 . If one sets 



vi 



where e^ ± 



ej ; + ej t if Aijj z > and e^ — ej l otherwise, then 

n 

A 2 vi = ^(Ai, ± Aij t )ei = \Aiiji\vi + 



i=i 



where 

n 

r = ^iiii e ii i Aj l j l ej l + (A^ ± Aij t )ei. 

By the same arguments that we used in Theorem [21 one can show that |p tends to as n tends 
to infinity. This implies an existence of an eigenvalue A of A 2 such that A = + o(l)). An 

application of Cauchy Interlacing Theorem just as before shows that in fact A = Xi(A 2 ), where 
Xi(A 2 ) represents the I— th eigenvalue of A2 in descending order. Since by Weyl's inequalities, 
one has for all I , 1 < I < n, 

\{A 2 ) + X n {A x ) < \i(Ax + A 2 ) < \{A 2 ) + Xi(Ai). (54) 

Dividing equation (|54|) by b 2 n and taking the limit one gets 

lim P(A,(^i + A 2 ) < b n x) = lim P(A,(A 2 ) < b n x) = lim ¥(a idl < b n x), 

n—>oo n— >oo n— >oo 

which ends the proof of (b). □ 

5. Sample Covapjance Matrices when 2 < a < 4. 

This section heavily uses the previous ones. We repeat the arguments of section using the 
results of section |H As before, the first step is to truncate our matrix. 
Define 

Ai = (Aijl^^ nl3 )ij, A 2 = A n - A\, (55) 

and also 

X 1 = A 1 A\, X 2 = X n -X 1 . (56) 

It is clear that X 2 = A\A\ + A 2 A\ + A 2 A\ and that the largest entries of A n belong to A 2 . We 
are going to study the eigenvalues of X 2 . The following lemma follows directly from the proof of 
Lemma [5] and Lemma [181 

Lemma 19. Let A 2 be defined as above. Also assume that p = ljn\ for some constant 7 > 1. 
Then: 

(a) If is the event 'There is a row with 2 entries greater than b & np in absolute value ' then 

M5 > 3/4, lim F(B s n ) = 0. 

(b) One has that 
lim ¥(3i, 1 < i < n, max |aj 3 - 

(c) One has that 
lim ¥(3j, 1 < j < p, max la™ 

p^oo l<i<n 

If one takes v = (0, . . . , 0, 1, 0, . . . , 0) where the only non-zero entry of v is in the position i\, 
{X 2 v, v) will give us the following bound: 

\max{X 2 )>a 2 ilh (l + o(l)). (57) 

This inequality is justified by the preceding lemma, just as in theorem [21 and by the fact that 
the diagonal of A\A\ + A 2 A\ has only zeros. More than that, as in section [3l we can also infer 
from Lemma [191 that 

\max{A 2 Ai) = a 2 hjl (l + o(l)). (58) 



P 



> bn P 16 and \aij\ — max \a,ij\ > bnp 16 ) = 0. 

j=i !<i<p 



1 p -i 

- + — . ^ ^ + -2L 

> bnp 16 and y. \ a ij\ ~ max \ a ij\ > bnp 16 ) = 0. 
j=i l <i<P 



Now, using Rayleigh-Ritz representation and linearity of the scalar product, we have that 
^max(X n ) = max (X n v, v) = max {{A\ + A 2 ) {A\ + A 2 )*v, v) 

v:\v\ = l v:\v\ = l 

= max UX\v, v) + (A 2 A\v, v) + {AxA^v, v) + (A 2 A[v, v)) 

v:\v\ = l 

which, by Cauchy- Schwartz, yelds 

Xrnax(X n ) < m&x(Xiv,v)+ max (A 2 A\v , v) + 2 max [|^4.* d[| max H-A^ll 

■y:|t)|=l d:|d|=1 i;:|u|=1 «:|ii|=1 

< Knax(X\) + \ rnax (A 2 A t 2 ) + 2 (\ rnax (Xi)\ max (A 2 A t 2 )) ^ . (59) 
In view of ([371) . ([351) . ([39]) and Weyl's inequality for A" n = Xi + X 2 , namely, 

it remains to show that the largest eigenvalue of X\ is negligible with respect to af^ to conclude 
that 

AmosC-Xn) = af xil (l + o(l)) 

and, therefore, finish the proof of Corollary [2J 

Thus, we turn back our attention to the truncated matrix X\ = A\A\. As we did before, we 
will show that its largest eigenvalue properly normalized remains bounded on a set of probability 
arbitrarily close to 1. Again, we study the asymptotics of some expectations of X\. 

By the results of the last section, we just need to control the expected value of some traces of 
the matrix X\. 

Let s n be as in section 14.21 One can write 

E(TrAT 1 n ) = ^ ^ Eai 1 i ai 1 i 2 ai i i 2 ai i j i . . . flj2s n -i«2s n -2 a i2s n -i«o- (^0) 

where & denotes the set of all closed paths P = {io, ii, . . . , i 2sn -i, io} with a distinguished 
origin, in the set {1, 2, . . . ,p} with the restriction it £ {1, 2, . . . , n} for odd t. Now, since ay 7^ dji 
orientation of an edge plays a role. 

We say that a path is odd if the number of passages in the direction i — > j plus the number 
of passages in the direction j — * i is odd for some % and j. P is even if it is not odd. Since 
we can center the random variables o>ij1-\ ai -\< n p just as in section [H odd paths with a non-zero 
contribution have at least 3 passages in one direction of an odd edge. We will prove that the 
contribution of odd paths to the sum (|60l) is negligible and that contribution of even paths can 
be easily bounded by the results on the Wigner case. To do so, one can proceed as follows. 

Construct apxp random symmetric matrix M = {yij)i<ij<p such that yij are independent 
identically distributed random variables with the same distribution as an. Hence, if we denote 
E(P) as the contribution of the path, 

E(-P) < E(TrM 2s "). (61) 

Pe.^,Peven 

In fact, one has a 1 — 1 relation between paths in the LHS and paths that give a non-zero 
contribution to the sum in the RHS. Furthermore, if an edge is read r times from left to 
right and s times in the opposite direction, then p + s = 2q for some integer q and we can use 
the inequality 

EafyEtfi < E\aij\ r E\aji\ s < EyJ 7 , (62) 

leading to ([5T]) . 

Therefore, proceeding as in section HI equation (fUT|) implies: 



Proposition 20. There exists a constant C > such that E (^Tr( n4 f a 1 _ 2e ) 2Sn )j < (2a) 



4s n Cn 
3/2 • 



Moreover, 



for some constant -q > 0. 

To prove that the contribution of odd paths is negligible with respect to (2a) 4s " -§7^, we can 

still bound the contribution of each path using the inequality ([62]) . Since to an odd path V, 
there corresponds an odd path in the expansion of ¥,(TrM 2Sn ), we analyze their contribution as 
in section HI 

£ E(P) « (2cx) 4 ^. (63) 

Pe^.Podd s n 

This ends the proof Corollary [2] in the case of 2 < a < 4. The proof of Theorem [2] in this case 
is identical to the proof of the case < a < 2. 
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